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ABSTRACT 

This paper presents a new Compressive Sensing (CS) scheme 
for detecting network congested Hnks. We focus on de- 
creasing the required number of measurements to detect all 
congested links in the context of network tomography. We 
have expanded the LASSO objective function by adding a 
new term corresponding to the prior knowledge based on 
the relationship between the congested links and the corre- 
sponding link Betweenness Centrality (BC). The accuracy 
of the proposed model is verified by simulations on two real 
datasets. The results demonstrate that our model outper- 
formed the state-of-the-art CS based method with significant 
improvements in terms of F- Score. 

Index Terms — Network tomography, Compressive sens- 
ing, Congestion detection. Prior knowledge. 

1. INTRODUCTION 

The network tomography scheme is a method to calculate link 
variables such as delay or bandwidth, using the end-to-end 
measurements. In this paper, we intend to solve a congestion 
detection problem using the network tomography scheme, 
based on end-to-end measurements. Our goal is to decrease 
the number of required measurements in order to identify all 
congested links. Since only a small number of network links 
may become congested, it is assumed that the delay vector 
(which contains delays of all links) is sparse. 

To solve this problem, we employ Compressive Sensing 
(CS) Q which has received significant attention in the recent 
years. Since its inception, CS shows remarkable results in re- 
constructing a high dimensional sparse signal with a small set 
of measurements. A significant issue in this field is the sample 
complexity which is the number of required measurements 
for high quality signal recovery which is solvable through 
LASSO d. Although CS is a new sampling scheme in signal 
processing, it has been applied to many network applications 
specially in the area of network tomography. Fault diagnosis 
II2II . traffic estimation fSl, data localization JH, and conges- 
tion detection |5 | are some well known applications in this 
field. 

The state-of-the-art CS-based algorithm in network to- 
mography [5], used random walks to gather end-to-end de- 



lays and employed LASSO in its model for congestion detec- 
tion. However, it has two major drawbacks; relatively high 
number of random walk measurements (much higher than the 
total network links), and low accuracy in detecting the con- 
gested links when the required number of measurements is 
small. In this paper, we try to solve those drawbacks by using 
the prior knowledge corresponding to the correlation of link 
betweenness centrality (the number of shortest paths that tra- 
verse from the link) and link congestion. We also introduce 
a new CS objective function in which we consider the depen- 
dency of congestion and betweenness centrality. We further 
show that the proposed objective function can be considered 
as the Elastic-Net model (H, which is a more stable alterna- 
tive to LASSO. We have simulated the proposed Compres- 
sive Sensing in Congestion Detection (CSCD) model, by us- 
ing real data with various configurations. We have compared 
the F-Score (of the retrieved congested links) in our model 
with 0, in terms of the number of measurements (random 
walks) and various sparsity of the network delays in which 
we achieve significant improvements. 

The rest of the paper is organized as follow; in Section |2] 
we describe the related work in network tomography, conges- 
tion detection, and compressive sensing. Section [3] shows an 
introduction to compressive sensing and the problem we try 
to solve in this paper. In Section (H we introduce the proposed 
compressive sensing model. Comparison with the previous 
model is appeared in Section [5] Finally, we conclude the pa- 
per in Section [6l 

2. RELATED WORK 

Compressive sensing in network tomography is first discussed 
in . The authors discussed a group testing problem on the 
Erdos-Renyi random graphs. They have applied OR opera- 
tion (instead of summation) on the gathered measurements 
and calculated the required number of measurements when 
the link variables are binary. In [5|, the authors gathered 
end-to-end delay information by applying random walks to 
the network and recovered the sparse delay values of net- 
work links using compressive sensing. Although they have 
better theoretical order for the number of measurements com- 
pared to [7 1, they have used high number of random walks to 
achieve good recovery percentage in practice. In their theo- 



retical proof, they required that the graph is highly connected 
which may not be true in all of the real networks. Moreover, 
their practical results are accomplished according to specific 
graphs such as the complete graph. 

In our model, however, we try to solve these practical is- 
sues by employing the relationship between link betweenness 
centrality and link congestion in the network. As noted in ||8l, 
link betweenness centrality and congestion are two measure- 
ments in the network which are highly correlated. In JH], (9], 
and ifTOl link betweenness centrality is used as the only ele- 
ment to detect the congested links in the network. Although 
betweenness centrality is effective to achieve this purpose, 
none of these papers consider end-to-end measurements in 
finding the congested links. Later in this paper, we show how 
efficient it is to employ both link betweenness centrality and 
end-to-end measurement for congestion detection. 

3. PROBLEM DEFINITION 

We consider a real communication network as an undirected 
graph with the set of links E, and vertices V. We assume 
each link can transfer data in both directions between two 
connected nodes. We are going to measure the network con- 
gested links using end-to-end delays of M random walk paths 
in the network (M < |£^|). Thus, we should first measure the 
delay of each network link. Let the vector y m x i denote the 
observed end-to-end delays where represents the end-to- 
end delay of the path i. We also define xatxi as the links' 
delay vector where Xj represents the delay of link j. Since 
the number of congested links in a network are sparse, x is 
considered as a /c-sparse vector (k <^ N and TV = |£^|). The 
goal is to recover vector xatxi which contains the delay of 
all N links of the network. Moreover, by recovering x, the 
congested links can be recognized through their high amount 
of delay. To recover this vector, compressive sensing method 
has been used. The recovery process starts with solving the 
following equation: 

y = Ax + e (1) 

where e is the noise vector that is experienced in the random 
walk measurements, and A is an M x TV matrix. The ele- 
ments of i^^ row and j^^ column of A is equal to 1, if the 
i^^ link of the network is available in the j^^ random walk 
path. We employ M random walks over the network graph 
and gather information regarding the end-to-end delay of each 
path (ji). Having A and y, we may reconstruct x by minimiz- 
ing ||x||o under the constraint of Eq. ([TJ. However, as stated 
in (HI, this is an NP-hard optimization problem. In com- 
pressive sensing, the aforementioned optimization problem is 
solved by minimizing the ||x||i instead of ||x||o which results 
in an optimization problem with computational complexity of 
0{N^) O, known as LASSO: 

x = argmin (A||x||i + 7|| Ax - y ||^) (2) 

X 



4. THE PROPOSED MODEL 

We have employed the Bayesian theory in our model which 
is used in many compressive sensing applications |[T2l |[T3]| 
lfT4l . The previous applications, however, have not consid- 
ered prior knowledge on x other than its sparsity. The main 
goal of our model is congestion detection with a few number 
of random walk measurements. We have employed the be- 
tweenness centrality |15 | of the congested links as the prior 
knowledge in ^^-norm minimization. 

Let D denote the maximum delay that each link can tol- 
erate in the network and 6^, < bi < B, denote the link 
betweenness of the link i where B is the maximum link be- 
tweenness centrality in the network. We use linear interpola- 
tion to capture the relation between the betweenness centrality 
and the prior belief about the delay values of network links. 
In this way, we get the scaled version of link betweenness 
centralities (s). 

With a high probability, a link with the maximum link 
betweenness (B) has the highest delay (D) in the network 
|[T6l . Moreover, a link with the lowest link betweenness (0) is 
more likely to have no delay. Thus, (0,0) and (B,D) are two 
points on the interpolating line. Therefore, by considering the 
link betweenness values as the X-axis (bi) and our prior be- 
lief about links' delay as the Y-axis (si) in a two dimensional 
space. Si is given by: 

s^=Dx^ (3) 

In order to recover x from y, it is critical that x be sparse. 
Since s is also a sparse vector (because it is highly correlated 
with x), X — s should be sparse too. Therefore, we define the 
probability density function of x as follows: 

where /ci,/c2 G M^. It penalizes non-sparse choices of x — s 
(by the first term) and also vectors x which are not similar to 
s (by the second term). By observing y, we intend to find x 
with the highest probability. Thus, we may maximize P(x|y) 
by using the Maximum a Posteriori probability (MAP) esti- 
mation as follows: 

Since the goal of Eq. (O is to find the maximum value of x 
in P(x|y), by eliminating the terms that do not depend on x, 
and taking logarithm on both sides, we obtain: 

max log(P(y|x) P(x)) = max (logP(y|x) +logP(x)) 

X X 

(6) 

On the other hand, by observing y, we intend to find x from 
Eq. ([T]). We assume e has a normal distribution with zero 



mean and covariance matrix cr'^I. Therefore: 
P(y|x)^A/'(Ax,a2/) 



(7) 



According to Eq.s ^ and (|7]), Eq. © may be expanded as 
follows: 



max (logP(y|x) + logP(x)) = 

X 

'exp(-(y-Ax)T^/(y-Ax)) 



log < 



log < exp 
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where C G M and M is the number of random walk measure- 
ments. Therefore, the optimization problem becomes: 
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(8) 

where 7 = A = and a = An important goal 
in CS is to have model consistency O, which shows that the 
support of recovered x converges to the support of original x 
as the number of random walk measurements goes to 00. To 
verify this property, we show that our problem is similar to 
the one discussed in which is known as Elastic-Net: 



f3 = arg nun 



(A||/3||i+7P/3-/ll 



(9) 



To increase the recovery accuracy, achieve model consistency, 
and overcome LASSO limitations (61, Elastic-Net is used as 
an alternative to LASSO. 

Consider /3 = x — s and y' = As — y. Since s is a 
constant matrix, it is easy to show that Eq. ([S]) and Eq. (O are 
two equivalent optimizations. Thus, our model is in the form 
of Elastic-Net. 

As mentioned before, we have M random walk measure- 
ments, N network links and k is the sparsity value. Without 
loss of generality, assume that only the first k elements of x 
are non-zero. Then xi = (xi, x^), X2 = (x/c+i, xat), 
^4(1) contains the first k columns of A, and ^4(2) contains the 
last N — k columns of A. 

Therefore, by having Cii = -^Aj^^A(^i^,Ci2 



where 77 > 0. In the next section, we show that in our model 
EIC is satisfied with a higher probability compared to IC. 

5. EVALUATION 

5.1. Simulation Framework 

In order to evaluate the proposed model, we have performed 
extensive simulations (30 runs) in MATLAB. We have used 
two real datasets. The first dataset contains the information of 
a mobile operator which has 273 links and 158 nodes corre- 
sponding to the network devices in Mobile Switching Centers 
(MSG) placed in 40 cities. The second dataset contains the 
information of a data network with 366 links and 277 nodes 
which are located in more than 50 cities. We have also consid- 
ered the variance of the results by measuring the related error 
bars. For the first dataset, we consider A = 10~^,a = 10~^, 
and 7 = 1. For the second dataset we have A = a = 7 = 1. 
Through several simulations, these are almost the best con- 
figurations for both LASSO and our model. The number of 
steps in each random walk is assumed to be 15. 

5.2. Validation 

To evaluate the proposed model, named Compressive Sensing 
in Congestion Detection (CSCD), we have used F-Score mea- 
sure which corresponds to the harmonic mean of precision 
and recall. Precision measures the percentage of correctly de- 
tected congested links to the sum of correctly and wrongly 
detected congested links, and recall refers to the percentage 
of correctly detected congested links to the total detected con- 
gested links. 

First, we have evaluated the proposed CSCD model and 
LASSO through model consistency. Assuming r] = 0.01, EIC 
in Eq. ([TT]) and IC in Eq. ([TT]) are verified for our model and 
LASSO, respectively. At the end of all 30 simulation runs, 
the percentage of the times that EIC and IC are satisfied, is 
computed. It has to be mentioned that CS-over-Graphs in B 
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-^A^2)^(2)' and C21 = ^A'J^2)^(i)' Irrepresentable 
Condition (IC) can be shown to be a necessary and sufficient 
condition for LASSO's model consistency |6|: 



37^ > : 



C2i{Cn)-\sign{^,)) 



<l-r] 



(10) 



Moreover, according to the Corollary 1 in [6 |, if the Elastic 
Irrepresentable Condition (EIC) is satisfied, the Elastic-Net 
model has model consistency by choosing right values for A, 
M, and a. The EIC is as follows: 



C2i{Cn + ^I)"^ ( sig<^i) + y 



<l-r] (11) 




Fig. 1: Comparison of probability that EIC holds in Elastic- 
Net and IC holds in LASSO in various number of random 
walks in the first dataset 

has used LASSO in its model. Moreover, in the theoretical 
proof of f5l, they required that the graph is highly connected 
which may not be true in all of the real networks. Choosing 



A VMlogM, A; = 8 as sparsity, and N 273 as the 
number of links in the first dataset, by increasing M such that 
M ^ oo, Figd] shows that in CSCD model model consis- 
tency holds with higher probability. 

The rest of this section shows the evaluation of F-Score 
in our model in various settings. Fig. [2] illustrates an im- 
provement of the F-Score performance of the CSCD model 
by an average of 5% (Dataset 1) and 10% (Dataset 2) for the 
various number of random walks compared to the Compres- 
sive Sensing over Graphs method (CS-over-Graph) presented 
in 0. The number of random walks changes from 10% to 
90% of the total network links. Although, we have evaluated 
F-Score for various random walk steps, the results were sim- 
ilar to those shown in Fig. [J] Clearly, for lower number of 
random walks, we have lower number of measurements, and 
thus less samples. Since the proposed model simultaneously 
employs the prior knowledge based on the link betweenness 
centrality, it performs better than CS-over-Graphs model flU 
where only the sparsity information (||x||o) is used. How- 
ever, as the number of random walks grows, our F-Score gets 
closer to CS-over-Graphs'. Because at higher number of ran- 
dom walk the prior knowledge becomes less significant. As 



random walks when F-Score equals to 50%. The number of 
random walks in CSCD is decreased by an average of 16% in 
the first dataset and 15% in the second one compared to CS- 
over-Graphs. We also compare the F-Score of our model with 




CSCD in Dataset 1 ( F-Score = 50% ) 

CS over Graphs in Dataset 1 ( F-Score = 50% ) 

CSCD in Dataset 2 ( F-Score = 50% ) 

CS over Graphs in Dataset 2 ( F-Score = 50% ) 
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Fig. 4: Sparsity versus various number of random walks in 
two datasets 



the algorithm that is used only Betweenness Centrality (BC) 
measurement for congestion detection (BC algorithm) as em- 
ployed in (a, in, and |[T0|. The result is illustrated in Fig. \5\ 
Since betweenness centrality is independent from the number 




CSCD for Datasetl (Sparsity =14) 
■■+" CS over Graphs for Datasetl (Sparsity =14) 
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'"•""CS over Graphs for Dataset2 (Sparsity =37) 
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Fig. 2: F-Score versus random walk number in two datasets 

illustrated in Fig. [S] we have also evaluated the F-Score of 
CSCD in terms of sparsity of the congested links in the net- 
work (||x||o). The F-Score of the CSCD model is improved 
by an average of 11% in the first dataset, and 9% in the sec- 
ond one compared to CS-over-Graphs model. In Fig. |4] we 
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Fig. 3: F-Score versus sparsity in two datasets 

have measured the required number of random walks in terms 
of the sparsity of the congested links in the network (||x||o). 
Considering the sparsity varies from 5% to 30% of the net- 
work links, we have calculated the least required number of 
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Fig. 5: Comparison of our model with BC algorithm in terms 
of (a) the required number of random walks (b) sparsity 



of random walks, it remains constant in Fig. |5(a)[ As shown 
in Fig. [21 CSCD outperformed the algorithms based on mere 
betweenness centrality, by an average of 12% on F-Score for 
different sparsities and 69% on F-Score for various random 
walk numbers. 



6. CONCLUSION 

In this paper, we introduced a new objective function based on 
the concepts of compressive sensing in a network tomography 
application. We used the link betweenness prior knowledge 
in our objective function which results in a decrease on the 
required number of measurements for detecting the network 
congested links. Based on extensive simulation results, we 
verified significant improvement in accuracy of detecting the 
network congested links in two real datasets. 
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